# FP8 Efficient Inference
Qwen3 235B A22B FP8
Apache-2.0
Qwen3 is the latest version of the Tongyi Qianwen series of large language models, offering a complete suite of dense models and Mixture of Experts (MoE) models. Based on large-scale training, Qwen3 achieves breakthrough progress in reasoning, instruction following, agent capabilities, and multilingual support.
Large Language Model
Transformers

Q
Qwen
47.30k
68
Qwen3 14B FP8
Apache-2.0
Qwen3 is the latest version of the Tongyi Qianwen series of large language models, offering a full range of dense models and mixture-of-experts (MoE) models, achieving breakthrough progress in reasoning, instruction following, agent capabilities, and multilingual support.
Large Language Model
Transformers

Q
Qwen
16.28k
19
Qwen3 4B FP8
Apache-2.0
Qwen3-4B-FP8 is the latest large language model in the Qwen series, offering a 4-billion-parameter FP8 quantized version that supports switching between thinking and non-thinking modes, excelling in reasoning, instruction following, and agent capabilities.
Large Language Model
Transformers

Q
Qwen
23.95k
22
Hyvid
MIT
An anime-style adapter based on Tencent's Hunyuan Video model, providing high-quality text-to-video generation capabilities, specifically optimized for anime-style content creation.
Text-to-Video English
H
calcuis
1,392
20
Uncensored Females Flux Fluxdevufv7fp16 Fp8 Flux
Other
FLUX.1-dev is a text-to-image generation model based on the diffusers library, focusing on FP8 floating-point optimization during the development phase, capable of generating realistic and photorealistic images.
Image Generation English
U
John6666
102
8
Nsfw Master Flux Lora Merged With Flux1 Dev Fp16 V10 Fp8 Flux
Other
FLUX.1-dev is an experimental text-to-image generation model focused on photorealistic, realistic-style image generation.
Text-to-Image English
N
John6666
311
7
Featured Recommended AI Models